Protein Structure Alignment in Subquadratic Time
نویسنده
چکیده
The problem of finding an optimal structural alignment for a pair of superimposed proteins is often amenable to the Smith-Waterman dynamic programming algorithm, which runs in time proportional to the product of the lengths of sequences being aligned. While the quadratic running time is acceptable for computing a single alignment of two, spatially “fixed”, structures, the time complexity becomes a bottleneck when running the Smith-Waterman routine multiple times in order to find an optimal pairwise superposition. We present a subquadratic running time algorithm capable of computing an alignment that optimizes one of the most widely used measures of protein structure similarity, defined as the number of pairs of residues in two proteins that can be superimposed under a predefined distance cutoff. The algorithm presented in this article can be used to significantly improve the speed-accuracy tradeoff in a number of popular protein structure alignment methods.
منابع مشابه
In Silico Analysis of Primary Sequence and Tertiary Structure of Lepidium Draba Peroxidase
Peroxidase enzymes are vastly applicable in industry and diagnosiss. Recently, we introduced a new kind of peroxidase gene from Lepidium draba (LDP). According to protein multiple sequence alignment results, LDP had 93% similarity and 88.96% identity with horseradish peroxidase C1A (HRP C1A). In the current study we employed in silico tools to determine, to which group of peroxidase enzymes LDP...
متن کاملAll - Against - All Sequence
In this paper we present an algorithm which attempts to align pairs of subsequences from a database of DNA sequences. The algorithm simulates the classical dynamic programming alignment algorithm over a digital index of the database. The running time of the algorithm is subquadratic on average with respect to the database size. A similar algorithm solves the approximate string matching problem ...
متن کاملSubtree Isomorphism Revisited
The Subtree Isomorphism problem asks whether a given tree is contained in another given tree. The problem is of fundamental importance and has been studied since the 1960s. For some variants, e.g., ordered trees, near-linear time algorithms are known, but for the general case truly subquadratic algorithms remain elusive. Our first result is a reduction from the Orthogonal Vectors problem to Sub...
متن کاملBioinformatics Analysis of Upstream Region and Protein Structure of Fungal Phytase Gene
Phytase increases the bioavailability of phytate phosphorus in seed-based animal feeds and reduces the phosphorus pollution of animal waste. Since most animal feeds for pellets are heated up to 65-80 °C, the production of a thermostable structure for phytase can be useful. In this study, we sought to perform bioinformatics analysis of the upstream region and protein structure of fungal phytase ...
متن کاملThe comparative analysis of the protease molecule structure of the Human lymphotropic virus type-1 (HTLV-1)
Background and Aims: Human lymphotropic virus type-1 (HTLV-1) causes various diseases such as adult T-cell leukemia/lymphoma (ATLL) and HTLV-1 associated myelopathy/tropical spastic paraperesis (HAM / TSP) in humans. The main goal of this study is to compare Iranian protease subtypes structure of this virus (HTLV-1) to samples collected from other part of world in order to understand their diff...
متن کامل